Adaptive Combination of Behaviors in an Agent

نویسندگان

  • Olivier Buffet
  • Alain Dutech
  • François Charpillet
چکیده

Agents are of interest mainly when confronted to complex tasks. We propose a methodology for the automated design of such agents (in the framework of Markov Decision Processes) in the case where the global task can be decomposed into simpler -possibly concurrentsub-tasks. This is accomplished by automatically combining basic behaviors using Reinforcement Learning methods. Basic behaviors can be either learned or reused from previous tasks since they will not need to be tuned to the new task. Furthermore, the agents designed by our methodology are highly scalable as, without further refinement of the global behavior, they can automatically combine several instances of the same basic behavior to take into account concurrent occurences of the same subtask.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An investigation into the relationship between care burden and adaptive behaviors of mothers of children with thalassemia

Abstract Background and Objectives Mothers of children with thalassemia as primary caregivers face problems with treatment and care issues. These problems as consequences of child’s illness often lead to inappropriate adaptive behaviors. The aim of this study was to disclose the relationship between caregiver burden and adaptive behaviors in mothers of children with thalassemia.   Materials a...

متن کامل

Adaptive neural control of nonlinear fractional order multi- agent systems in the presence of error constraintion

In this paper, the problem of fractional order multi-agent tracking control problem is considered. External disturbances, uncertainties, error constraints, transient response suitability and desirable response tracking problems are the challenges in this study. Because of these problems and challenges, an adaptive control and neural estimator approaches are used in this study. In the first part...

متن کامل

بررسی مقایسه ای رفتار انطباقی در کودکان ناشنوا و کودکان با شنوایی طبیعی 12 تا 36 ماه

Introduction: Hearing loss in children is a main cause of malfunction in them. On the other hand, adaptive Behavior includes the age-appropriate behaviors necessary for people to live normally and in daily life. The objective of this study was to compare the adaptive behaviors of 12-36 months deaf and normal children. Methods: In this case- control study, we compared adaptive behaviors score o...

متن کامل

Optimal adaptive leader-follower consensus of linear multi-agent systems: Known and unknown dynamics

In this paper, the optimal adaptive leader-follower consensus of linear continuous time multi-agent systems is considered. The error dynamics of each player depends on its neighbors’ information. Detailed analysis of online optimal leader-follower consensus under known and unknown dynamics is presented. The introduced reinforcement learning-based algorithms learn online the approximate solution...

متن کامل

Intelligent multi-agent modeling of the interbank network and evaluation of the impact of regulatory policies

agent-based modeling is an emerging computational technique that makes it possible to simulate complex economic systems, including the banking network, with a bottom-up approach. In this paper, the country's banking network is simulated with an intelligent multi-agent modeling model and indicates that these agents behave based on the adaptive learning. This modeling has been done with the aim o...

متن کامل

Combination of Adaptive-Grid Embedding and Redistribution Methods on Semi Structured Grids for two-dimensional invisid flows

Among the adaptive-grid methods, redistribution and embedding techniques have been the focus of more attention by researchers. Simultaneous or combined adaptive techniques have also been used. This paper describes a combination of adaptive-grid embedding and redistribution methods on semi-structured grids for two-dimensional invisid flows. Since the grid is semi-structured, it is possible to us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002